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ABSTRACT 

The existence of massive galaxies with strongly suppressed star formation at z '^ 2.3, identified in a 
previous paper, suggests that a red sequence may already be in place beyond z — 2. In order to test 
this hypothesis, we study the rest-frame U — B color distribution of massive galaxies at 2 < z < 3. 
The sample is drawn from our near-infrared spectroscopic survey for massive galaxies. The color 
distribution shows a statistically significant (> 3cr) red sequence, which hosts ~ 60% of the stellar 
mass at the high-mass end. The red-sequence galaxies have little or no ongoing star formation, as 
inferred from both emission-line diagnostics and stellar continuum shapes. Their strong Balmer breaks 
and their location in the rest-frame U — B, B — V plane indicate that they are in a post-starburst phase, 
with typical ages of ~0.5-1.0Gyr. In order to study the evolution of the red sequence, we compare 
our sample with spectroscopic massive galaxy samples at 0.02 < z < 0.045 and 0.6 < z < 1.0. The 
rest-frame U — B color reddens by ~ 0.25 mag from z ^ 2.3 to the present at a given mass. Over 
the same redshift interval, the number and stellar mass density on the high-mass end (> 10^^ -^0) of 
the red sequence grow by factors of ^ 8 and ~ 6, respectively. We explore simple models to explain 
the observed evolution. Passive evolution models predict too strong A(C/ — B), and produce 2: '^ 
galaxies that are too red. More complicated models that include aging, galaxy transformations, and 
red mergers can explain both the number density and color evolution of the massive end of the red 
sequence between z '^ 2.3 and the present. 
Subject headings: galaxies: evolution — galaxies: formation — galaxies: high-redshift 



1. INTRODUCTION 

Early type galaxies with quiescent stellar populations 
form a well-defined color-magnitude or color-mass rela- 
tion at z '^ 0, known as the red sequence. They are 
clearly separated from blue star- forming galaxies, which 
populate a different, less-tight sequence, called the blue 
cloud. While the red sequence is primarily build up of 
massive galaxi es, blue ga laxies have lower stellar masses 
(e.g, [KauffmaniLeLaljSoOS). 

The appearance and evolution of the red se- 
quence provide a powerful method to study the star- 
formation and assernbly history of massive galax- 
ies fe.g ■ iBower et al.l Ili 92|; ISchweizer fc Seitzeil1l99l : 
Ivan Dokkum et ahl Il998l ). The red sequence exhibits 
a tilt and spread which are thought to be primarily 
drive n by metalli c ity and age differences, respectively 
(e^. lFabeJ]]J7a _ rWo"rthevl 119941 : iKodama fc Arimotol 
ll997l : lKodama et al.l[l999( l. Both the shape and the color 
of the red sequence evolve over cosmic time. Several pro- 
cesses are responsible for this evolution. First, the color 
gradually reddens due to aging of stellar populations. 
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Second, the red sequence grows through transformations 
of blue galaxies. These transformations change the mix 
of properties of red-sequence galaxies and may cause the 
observed evolution of the red sequence to deviate from 
the expectations from passive evolution. Third, merg- 
ers among red-sequence galaxies change the red-galaxy 
mass function and may affect the color, slope, a nd scat- 
ter of the red sequence (e.g.. iBower et al.lll992f ). Thus, 
the evolution of the color, the shape, the number and 
mass density of the red sequence sets direct constraints 
on the assembly and star formation history of massive, 
early- type galaxies. 

The evolution of the red sequence between z r^ 1 and 
z '^ has extensively been studied for this purpose. 
Overall, these studies find that the color evolution is con- 
sistent with passive evolution (e.g.. lBell et al.l[2004[ ). the 
mass on the red sequence do ubles in this redshift interval 
(eg^ iBell et al, 2004 ; Faber et all l2007l : lArnouts et aP 
l2007t l. and the growth at higher masses is attributed 
to both red merg ers and galaxy transformations (e.g., 
iBundv et al.l 120071 ). As a significant part of the red se- 
quence was already in place at z ~ 1, we have to push 
our studies to higher redshift to trace the onset and first 
build up of the red sequence. 

Recent high-redshift studies rep ort the detection of 
the red sequence up to z = 2 (e.g.. lArnouts et al]l2007t 
ICassata et al.ll2008[ ). Moreover, developments in NIR in- 
strumentation have enabled the first spectroscopic confir- 
mations of quiescent galaxies without d etected emission 
lines beyond z = 2 (jKriek et al.ll2006ai rbr). In particular 
the cross-dispersed mo de of the Gemin i Near-Infrared 
Spectrograph (GNIRS. lEJias et al.l[2006h . with a wave- 
length coverage of 1-2.5 /xm allows systematic studies 
of massive galaxies at z ^ 2.3. Using this instrument 
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we have completed a NIR spectroscopic study of 36 K- 
selected galaxies at 2 < z < 3. In this paper we use 
this survey to study the onset and color evolution of the 
red sequence. The spectroscopic redshifts in combina- 
tion with the accurate continuum shapes as provided by 
the NIR spectra, allow for the first time accurate rest- 
frame color determinations of quiescent, massive galaxies 
beyond z = 2. 

Throughout the paper we assume a ACDM cosmology 
with »m = 0.3 fU = 0.7, and Hq = 70 km s~^ Mpc~\ 
and a ISalpeten (jl955[ ) initial mass function (IMF) be- 
tween 0.1 and 100 Mq. All broadband magnitudes are 
given in the Vega-based photometric system. 

2. DATA 

The data used in this work are extracted fr om our NIR 
spect roscopic survey for massive galaxies (jKriek et al.l 
|2008| 1. The full sample consists of 36 X-bright galax- 
ies observed with GNIRS in cross-dispersed mode (1.0- 
2.5 /im), between 2004 September and 2007 March (pro- 
grams: GS-2004B-Q-38, GS-2005A-Q-20, GS-2005B- 
C-12, GS-2006A-C-6, GS-2006B-C-5 and GS-2007A- 
C-9). The galaxies were originally selected from 
the multi-waveleng t h survey by Yale- Chile (MUSYC, 
iGawiser et al.l[2006l : iQuadri et al.ll2007l ). which provides 
us with accurate optical-to-NIR (UBVRIzJHK) pho- 
tometry 



In iKriek et al.l ()2008l ) we show that distribution of the 
rest-frame U — V and observed R — K colors of our 
spectroscopic sample are representative of a mass lim- 
ited sample at 2 < z < 3. However, we do note that we 
are biased towards galaxies with brighter i^-band magni- 
tudes. Although this mainly reflects the relatively lower 
redshifts of these galaxies, as explained in detail in Krick 
et al. 2008, we might be missing galaxies with higher 
mass-to- light ratios (M/L). Further details about sam- 
ple completeness, observations, red uction and e x tracti on 
of the spectra can also be found in ' Kriek et al.l ()2008[ ). 

For this work we use the 28 galaxies within the range 



2<Zs 



< 3. Stellar masses and other population prop- 



erties are derived by stellar population mod eling as de- 
scribed in det ail inlKriek et all (l2006aLl2008l ). and given 



in Table 2 in iKriek et al.l ( 2008( 1 . In summary, we fit 



the spectra together with the broad band optical pho- 
tometry bv'Bruzu al fc CharlotI ([2003) stellar population 
models, assuming an exponentially declining star forma- 
tion history, solar met allicity, the ICalzetti et al.l (|2000l ) 
reddening law and the lSalpeterl (|1955D IMF between 0.1 
and 100 Mq. We allow a grid of 41 values for Ay be- 
tween and 4 mag, 31 values for the characteristic star- 
forming timescale (r) between 10 Myr and 10 Gyr, and 
24 values for age (not exceeding the age of the universe) . 
Uncertainties on the stellar population properties are de- 
rived using 2 00 Monte Ca rlo simulations as described in 
IKriek et al .1 (2006a, "2008). We leave redshift as a free 
parameter for galaxies without emission lines. We tested 
the continuum redshifts using the emission-line galaxies 
in our sample, an d found an u ncertainty in Az/(1 -I- z) 
of less than 0.019 (jKriek et al.| [2008). 

Rest-frame U — B^ colors are also determined from the 
best-fit stellar population models. In the same fashion 
as for the stellar population properties, confidence lev- 

"^ Throughout this paper we use the Buser U, B3 and V filters. 



els are derived from Monte Carlo simulations. Hence, 
the confidence intervals on the rest-frame colors include 
the uncertainties on the continuum redshifts for galax- 
ies without emission lines. The colors are not measured 
directly from the spectra, as for several galaxies the U- 
band is not covered completely by the NIR spectrum, 
or the U-band falls partly in between the J and H at- 
mospheric windows. As our grid allows almost 30 000 
different synthetic spectra, we do not expect the colors 
to converge to certain best-fit templates. Nevertheless, in 
order to examine whether using best fits may introduce 
systematics in the derived rest-frame colors, we directly 
measure the colors from the NIR spectra in combination 
with the optical broadband photometry. We find no sys- 
tematic offset between the direct colors and those derived 
from the best fits. 

The unique aspects of this data set are the accurate 
spectroscopic redshifts, rest-frame colors and masses. 
Although broadband photometric studies provide much 
larger galaxy samples, they lack the accuracy needed for 
this study. 

3. A RED SEQUENCE AT ^ ~ 2.3 

3.1. The Detection of the Red Sequence at z ^ 2.3 

Figures [1^ and b present rest-frame U — B color versus 
stellar mass for the 2 < z < 3 massive galaxy sample. 
The colors are corrected for redshift differences within 
the sample, as will be explained in § 14.21 These cor- 
rection are very small (^^0.001), and barely change the 
appearance of this plot. The distribution of galaxies in 
Figures [1^ and b is striking, as there are many galaxies 
with similar, red colors. In order to test whether a red se- 
quence was already in place at this early epoch, we exam- 
ine the rest-frame color distribution. But first we correct 
the colors for the tilt of the re d sequence, by assumin g 
the slope of z = red sequence ()van der Wei et all2007D . 
The applied correction has the form 

[U - B)m = {U -B)- 0.08 (logM/Me - 11.3) (1) 

The residuals are shown by the black histograms in Fig- 
ures llJ; and d. The distribution exhibits a conspicuous 
peak at {U — B) ^ 0.25 mag. We test the significance of 
the red sequence by calculating the probability to obtain 
this peak when assuming a flat distribution in rest-frame 
U — B color. If the true distribution is uniform the prob- 
ability of finding nine or more galaxies in any one of the 
three red bins and finding 16 or more galaxies in total 
with U - B > 0.1 is less than 0.001. Thus the change of 
finding the detected red sequence by change is less than 
0.1%. This implies that the detection of the red sequence 
is significant at the > 3cr level. This all strongly suggests 
that a red sequence of massive field galaxies was most 
likely already in place at z ~ 2.3. 

We determine the location of the peak of the red se- 
quence by fitting a Gaussian to the color distribution. 
We average over many binning positions to obtain a dis- 
tribution that is not affected by the particular choice of 
binning. In order to avoid including blue-cloud galax- 
ies, we restrict the fitting region to all galaxies with 
([/ — B)m > {U — B)pcak — 0.1. Hence, this procedures 
requires a few iterations. The cut-off value of 0.1 mag 
is chosen as it is twice the scatter in rest-frame U — B 
color of the local red sequence (see § 14. 2p . The best fit 
is indicated by the solid curve in Figures [TJ; and d. The 
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Fig. 1. — Rest-frame U — B vs. stellar mass {top panels) and the color distribution at 2 X 10 Mq along the z ~ 0.0 slope {bottom 
panels) for the 2 < z < 3 massive galaxy sample. In the bottom panels we show only galaxies more massive than 10^^ Mq (to the right of 
the dashed line in top panels). The black histograms in the bottom panels show a significant peak (> Scr), indicating that a red sequence 
was already in place at z ~ 2.3. The solid curve in the bottom panels represents the best fit to the color distribution of the red-sequence 
galaxies. The resulting location of the red sequence is indicated by the solid line in the top panels. All galaxies above the dotted line in 
the top panels are defined as red-sequence galaxies in this work {{U — B)m > {U — i3)pcak ~ 0.1). The left and right panels illustrate 
the properties of the galaxies according to SED modeling and emission line diagnostics, respectively. The symbols in panel a indicate the 
best-fit specific SFRs. In panel b the different symbols s how if emission lin es are detected for the galaxies, and whether the emission lines 
are dominated by star formation or by AGN activity (see lKriek et al.ll20071) . The corresponding color distributions are presented in panel c 
and d by the matching colors. The average Itr confidence interval is given in the top loft of the top panels. Both independent star formation 
indicators imply that the red sequence at z ~ 2.3 is dominated by galaxies with little or no ongoing star formation. 



peak of the distribution is shown by the sohd orange hne 
in Figures [1^ and b. All galaxies above the dotted line 
{[U—B]m > [C/—i3]pcak— 0.1) are defined as red-sequence 
galaxies from hereon. 

Our result may seem in disagreement with previous 
studies, some of which indicate that the red sequenc e 
disappears beyond z = 1.5 (e.g.. ICirasuolo et al.ll2007f ). 
However, finding a red sequence, as is well known, re- 
quires very accurate rest-frame color determinations. 
The use of a galaxy sample with spectroscopic redshifts 
and stellar continuum shapes - and not just photomet- 
ric information - enables us to detect a significant red 



sequence beyond z = 2, in contrast to previous studies. 
Broadband photometry in combination with photometric 
redshifts with errors of Az/(1 + z) ^ 0.07 gives random 
errors of 0.1 mag in rest-frame U — B, and systematic 
errors may play an even larger role. Furthermore, the 
typical uncertainties on stellar mass and absolute mag- 
nitude are a facto r of '^ 2 and ^0.4 mag, respectively 
(jKriek et al.l 120081 ). Thus, the uncertainties on the lo- 
cation of the individual red-sequence galaxies are larger 
than the width of the intrinsic red sequence. This im- 
plies that photometric studies with errors of ~ 0.07 in 
Az/(1 -|- z) are not able to recover a red sequence. Stud- 
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ies that lack spe ctroscopic redshifts in the relevant red- 
shift range (e.g.. lCirasuolo et al.ll2007f ) to calibrate their 
photometric redshifts, most likely have even larger errors 
in Az/(l-|-z). 

Whereas we do detect a significant red sequence, we 
find no bimodality in the galaxy distribution. We can- 
not reliably comment on bimodality since larger samples 
over a larger mass range are needed to assess whether the 
blue galaxies are distinct fro m the red ones at this epoch. 
Nonetheless, recent work bv lCassata et al.l (|2008[ ). based 
on a spectroscopic study over a larger stellar mass range, 
shows that the galaxy bimodality exist at least out 
to a redshift of z = 2. We note that the study by 
ICassata et al.l ()2008[) is based on spectroscopic redshifts, 
reinforcing our hypothesis that the current result can 
only be obtained with redshifts more accurate than the 
standard photometric redshifts. 

Finally, we stress that the detection of a red sequence 
is independent on whether the sample is fully represen- 
tative of a mass-limited sample. In this context it is 
interesting to note that the red sequence has originally 
been discovered and mainly studied in magnitude- and 
not mass-limited samples. 

3.2. Properties of Red- Sequence Galaxies at z ^ 2.3 

In the previous section we showed that a red sequence 
was already in place dX z ^ 2.3. The well-defined shape 
of the red sequence could be a consequence of the con- 
verging colors of evolved galaxies. However, our sam- 
ple is small, and dusty starburst galaxies, known to 
be highly abundant at thes e redshifts (e.g.. Webb et al] 
l2006t iPapovich et al.ll2006l ). may contribute to, or even 
dominate the red sequence at these early epochs. In or- 
der to test whether these red-sequence galaxies indeed 
host quiescent stellar population, we examine the star 
formation properties using several diagnostics. 

In Figure[T^ the galaxies are coded following their best- 
fit specific SFR d erived from model ing their stellar con- 
tinua (see §[5] and iKriek et"aLll2008f ). The corresponding 
color distributions are presented in Figure [TJ;. Most red- 
sequence galaxies are best-fit by specific SFRs less than 
0.05 Gyr~^, and three have specific SFRs between 0.05 
and 0.5 Gyr~^. The uncertainties o n the specific SFR s 
are about a factor of ^ 3 on average (jKriek et al.ll2008[ ). 
Nevertheless, the large fraction of galaxies with low spe- 
cific SFRs suggests that red sequence is not dominated 
by dusty starbursts. 

A combination of two rest-frame colors, such that one 
isolates the optical break, and the other color measures 
the slope of the spectrum redwards of the optical break 
(e.g., Forster Schrcibcr et al. 2004; Labbc ct al. 2005; 
IWuvts et al.ll2007f l. may also be used to discriminate be- 
tween dusty star-forming galaxies and quiescent stellar 
populations. In Figure[2]we show rest-frame U—B versus 
B —V for all galaxies at 2 < z < 3. Color evolution tracks 
of lBruzual fc CharlotI (|2003f ) models show that quiescent 
stellar populations have a different locus than dusty star- 
bursts. As expected, the red-sequence galaxies are closer 
to the simple stellar population (SSP) model tracks. The 
remaining galaxies can roughly be divided in those that 
have colors more comparable to constant star forming 
(CSF) models with dust, and galaxies that will probably 
soon join the red sequence. 

These results are supported by independent emission 
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Fig. 2.— Rest-frame U - B versus B - y for the 2 < ^ < 3 
massive galaxy sample. The symbols indicate the emission-line 
diagnostics, identical to those in Figure [TJi. The colors indicate 
the best-fit specific SFRs similar as the color coding in Figure [T^. 
The red-sequence galaxies are indicated by the orange open circles. 
The average 1 a confidence interval is given i n the top left. The 
black curves show the color evolution tracks of lBruzual &: Charloti 
II2003I ) models for an SSP {dashed line), an exponentially declining 
model with a t of 300 Myr {dotted line), and a CSF model {solid 
line), all for solar metallicity. The dashed, gray curve represents 
an SSP model with Z = 2.5Zq. Ages in Gyr are indicated along 
the tracks. The vect or indicates a reddening of Ay = 1 mag for a 
ICalzetti et al.l I I2000I) law. The B -V colors imply that the red- 
sequence galaxies are in a post-starburst phase. 

line diagnostics, presented in Figure [TJd and d. We di- 
vide the sample according to whether emission lines are 
detected in the rest-frame optical spectra. Subsequently, 
the emission-line galaxies are sorted for the dominant 
origin of their line emission: using primarily emission- 
line ratios we discriminate between active gal actic nuclei 
(AGNs) and Hii regions (jKrieket al.ll2007f) . For two 
galaxies we have no information on the line emission as 
the lines are expected at wavelengths with low atmo- 
spheric transmission. 13 out of 15 red-sequence galaxies 
have no detected emission lines, or the line emission is 
dominated by AGNs. 

Thus, both the stellar continua and the emission line 
diagnostics suggest that the red sequence at z ^ 2.3 
is dominated by galaxies with quiescent stellar popula- 
tions. In particular, 7 out of 9 previously identified galax- 
ies with strongly su ppressed star formation presented in 
iKriek et al.l (|2006b( ) fall on this red sequence. The rest- 
frame U — B colors of the two remaining galaxies with 
strongly suppressed star formation are just below the cut- 
off value, and their locus in Figure [5] is near the SSP 
track. Thus, they will most likely soon join the red se- 
quence. 

The rest-frame B — V colors in combination with U — B 
indicate that the red-sequence galaxies at z ~ 2.3 are 
likely in a post-starburst phase (Figure [2]) . This is fur- 
ther illustrated in Figure [3l in which we show the stacked 
low-resolution spectrum of all red-sequence galaxies. For 
comparison we show a 2 Gyr SSP model with a promi- 
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Fig. 3. — Mean stack of the low-resolution GNIRS spectra of 
the red-sequence galaxies at 2 ~ 2.3 (black curve). Overplotted is 
the mean of all best fits to the spectra in blue. For comparison we 
show a 2 Gyr SSP model in red. This figure shows that in contrast 
to the 2 Gyr model, the optical break of the stacked spectrum is 
dominated by the Balmer break, typical for post-starburst galaxies. 
This may imply that the red sequence has just been starting to 
build up at 2 ~ 2.3. 



nent 4000 A break as well. In contrast to such old stel- 
lar populations, the optical break for the z ^ 2.3 red- 
sequence galaxies is clearly dominated by the Balmer 
break. Overall, our findings may imply that the mas- 
sive end of the red sequence is just starting to build 
up at z ~ 2.3, and was like ly not yet in place beyond 
z ~ 3. iKodama et al.l (|2007| ) drew the same conclusion 
by studying the stellar populations in protoclusters at 
2 5, z < 3. Our work is also consistent with the study 
by iBrammer fc van DokkurnI (|2007) who found that in 
contrast to z ~ 2.4, red galaxies at z ~ 3.7 have sig- 
nificant UV emission and are thus still actively forming 
stars. We note, however, that as explained in § 2, we 
might be missing galaxies with higher M/L, as these are 
relatively faint in K. 

Furthermore, Figure [2] provides us with a clear illustra- 
tion of the different processes that may be responsible for 
the spread and the tilt of the red sequence at z ^ 2.3. 
U — B and B — V show a positive correlation for the 
red-sequence galaxies. The typical la confidence con- 
tour shows that random errors can not fully account for 
the spread, and other effects are likely to play a role. The 
two SSP tracks, indicated by the dashed lines, show that 
both age and metallicity differences may be responsible 
for the spread and the tilt. Also the redshift spread of the 
red sequence galaxies, corresponding to 0.5 Gyr, will in- 
duce scatter in colors. Finally, reddening by dust moves 
a galaxy in a similar direction as aging and metallicity, 
and may also contribute to the spread and the tilt of the 
red sequence. The importance of the different processes 
can not be addressed with the current data, and indepen- 
dent dust, age, and metallicity constraints are required 
to break the degeneracies. 



Finally, we stress that, although the star formation 
activity in the red-sequence galaxies appears low, the 
galaxies may still be reddened by fair amounts of dust. 
Best-fit stellar population models indicate an average 
dust content of Ay = 0.8 mag (using the Calzetti redden- 
ing law). However, due to degeneracies between age and 
dust, Av is poorly constraine d with typical uncertainties 
of 0.5 mag (jKriek et al.ll2008[ ). and dust-free models pro- 
vide a lmost equally good fits to the spectra (jKriek et al.l 
l2006bf) . Metallicity {Z) further complicates this degen- 
eracy, and as it is fixed to Zq during fitting, the lack 
of appropriate metallicities may have been compensated 
by adjusting age or Ay. Independent indicators, such as 
mid-infrared (MIR) imaging or Balmer decrements are 
needed to better constrain the dust content in these red- 
sequence galaxies. MIR imaging also reveals whether 
obscured star-forming regions may have be en missed. In 
this context it is interestin g to n ote that iReddv et al.l 
(|2006aD and lPapovich et al.ll2006D show that about half 
of the galaxies at the high mass-end of the galaxy dis- 
tribu tion at z ^ 2 . 3 are n ot detected at 24/im. More- 
over, iReddv et al.l (|2006af ) find that the Ha luminosity 
for z '^ 2 galaxies tracks the bolometric luminosity very 
well. Finally, large amounts of dust in all red-sequence 
galaxies studied in this work seems unlikely, because it 
would then be hard to explain the narrow peak in the 
observed color distribution. 

4. MEASURING THE EVOLUTION OF THE RED 
SEQUENCE 

In order to measure the rest-frame U — B evolution of 
the red sequence, we compare our z ~ 2.3 results with 
those of lower redshift samples in this section. We use 
two spectroscopic massive galaxy samples (> 10^ ^Mq) 
at 0.02 < z < 0.045 and 0.6 < z < 1.0, extracted 



from the Sloan Digital Sky Survey (SPSS. lYork et al 



2QO0), Data Release 5 (DR5; lAdelman-McCarthv etaL 



2007) and the Great Observator ies Origins Deep Survey 
(GOODS. [Giavalisco et aLll2004[ ). respectively. 

4.1. Spectroscopic Samples at Lower Redshifts 

For our lowest redshift sample we use a complete, 
mass-selected, volume-limited sample of galaxies at red- 
shifts 0.02 < z < 0.045, extracted from the SDSS. See 
Ivan der Wei et aLl (|2007f) for more details about the ex- 
traction and completeness of the sample. The SDSS u~g 
and g — r colors are used to derive rest-frame U — B col- 
ors^. M/L are derived from the g — r colors (corrected 
for galactic extinct ion and redshift) using the relation 
bv lBell et al.l (|2003f ). The inferred stellar masses are in- 
creased by 0.15 dex to account for differences in the IMF. 
The final massive (> IO-'^^M^q) galaxy sample consists of 
903 galaxies. 

For the intermediate redshift sample at 0.6 < z < 1.0, 
we use a mass-selected, volume-limited galaxy sample, 
constructed from GOODS-south. A detailed descrip- 
tion of the extr action of the samp le and completeness 
can be found in Ivan der Wei et al.l (J2007). Rest-frame 
U — B and B — V colors for the 0.6 < z < 1.0 sam- 
ple are derived from the F606W, I775W and F850LP 

* The AB to Vega zcropoint conversion for the U , B, and V 
ba nds used in this paper arc slightly different from those adopted 
in Ivan der Wei et al.i (i2007i). The conversions use d in this paper 
are the same as used bv lBruzual fc CharlotI II2003I ). 
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ACS photometry. Subsequently, B — V provides us 
with stellar ma sses, using the empirical relations by 
iBell et aLl (|2003f ). Again, the stellar masses are increased 
by 0.15 dex. Of this sample, 137 galaxies have masses 
> 10-^^ Mq. S pectroscopic redshifts are known for 70% 
of this sample (iLe Fevre et al.ll2004t iMignoli et al.l l2005l: 
Ivan der Wei et all l2005t IVanzella et all |2006[ ). As this 
spectroscopic sample is not fully representative of the 
total sam ple, we include th e galaxies with photometric 
redshifts (jWuvts et al.ll2008[ ) when appropriate. 

4.2. The Color Evolution of the Red Sequence 

For an accurate measurement of the U — B color evolu- 
tion, it is crucial to derive the color of the red sequence 
in a similar fashion as for our z ~ 2.3 sample. In the 
top panels of Figure [4] we show rest-frame U — B versus 
stellar mass for all three massive galaxies samples. The 
colors are corrected for redshift differences within each 
subsample, in a self-consistent way, by using the evolu- 
tion of the red-sequence color with time derived below. 
Next, we subtract equation ^ from the data and de- 
termine the peak of the color distribution in the same 
way as was done for the z ~ 2.34 sample. The extracted 
color distribution along the z ~ 0.0 slope is presented 
in the lower panels of Figure ID Galaxies with colors 
{U — B)m,z > {U — B)pcak — 0.1 are defined as belonging 
to the red sequence. The peak locations are indicated by 
solid lines in Figure ID For the 0.6 < z < 1.0 galaxies 
we exclude the 30% without spectroscopic redshifts when 
deriving the peak of the red sequence, as these galaxies 
have less accurate rest-frame colors. 

Random errors on the location of the red sequence are 
determined using bootstrapping, and are represented by 
the dotted lines in the lower panels of Figure ID The 
use of different samples and different method to derive 
rest-frame colors and stellar masses may introduce addi- 
tional systematic errors. In order to correctly interpret 
the observed evolution, we examine the following effects. 

- Aperture differences Elliptical galaxies exhibit color 
gradients (e.g., Franx & lUingworth 1990). Thus, differ- 
ences in the apertures that were used to measure rest- 
frame U — B may introduce systematics in the color evo- 
lution. For both the z ^ 0.03 and z ~ 0.73 sample 
the rest-frame U — B colors are derived for an aperture 
which is about equivalent to the half-light radius. For the 
z ~ 2.3 sample we use the NIR spectra to measure rest- 
frame U — B. The aperture sizes are rectangular and de- 
pend on the slitwidth (of 0'.'675), the extraction aperture, 
and extraction method. We use a weighted (with S/N) 
extraction and include spatial elements that have a flux 
large r than 0.25 times the maximum flux (jKriek et al.l 
l2008f ). Thus, the total aperture size is dependent on the 
light distribution of the galaxy. The average effective 
aperture of the z ~ 2.34 sample is comparable to a cir- 
cular aperture of ~ 8 kpc. 

In order to quantify possible systematics in rest-frame 
U ~ B determinations of red-sequence galaxies between 
the z < 1 and z ^ 2.3 samples, we measure rest-frame 
U — B ioT all galaxies in the z ~ 0.73 sample using an 
aperture of 1"(~7.6 kpc at z ~ 0.73) instead of 0'.'5. This 
aperture size is comparable to that used for the z ~ 2.3 
sample. For the increased aperture the U — B color of 
the z '^ 0.73 red sequence is 0.01 mag bluer. In order 



to correct for this effect, we have reduced the colors of 
the 0.02 < z < 0.045 and 0.6 < z < 1.0 galaxies by 0.01 
mag. 

- Zero points Uncertainties in zero points directly 
result in systematic uncertainties in the derived rest- 
frame colors. The uncertainty on individual zero points 
is < 0.01 mag for both the 0.02 < z < 0.045 ^ and 
0.6 < z < 1.0 (Sir ianni et"aLll2005D samples. Thus, the 
resulting uncertainty on U — B is < 0.02 mag. For the 
z ^ 2.3 sample we use AVO-type stars for calibrating 
the spectra. This results in an uncertainty on rest-frame 
U — B oi ^ 0.03 mag. For this sample the broadband 
photometry provides us with an indep endent check on 
the zero points. In iKriek et al.l ()2008[ ) we flnd a sys- 
tematic offset of ^ 0.01 mag between rest-frame U — V 
derived from the photometry (using Zgpoc) and the NIR 
spectra. This shows that the zero points probably intro- 
duce no large systematics for the z ~ 2.3 sample. 

- Rest-frame color determinations The determination 
of rest-frame colors may also result in systematic errors. 
For the low and intermediate redshift samples we use the 
photometry in combination with spectroscopic redshifts. 
This method may result in systematics of ~ 0.01 mag and 
- 0.03 mag for the 0.02 < z < 0.045 and 0.6 < z < 1.0 
samples, respectively. For the z '^ 2.3 sample we have 
higher resolution spectral shapes, provided by the NIR 
spectra. In the previous paragraph and in § [2] we dis- 
cussed two tests to assess rest-frame color determina- 
tions, and neither of them exhibit signiflcant systematics 
(- 0.01 mag). 

- Stellar mass determinations Although the stellar 
masses are all determined for the same IMF, the differ- 
ent methods used for the z < 1 and z ~ 2.3 samples 
may have introduced systematic errors. We tested this 
by determining the stellar masses for the z ^ 2.3 galaxies 
using the same method as for the lower redshift samples. 
This yielded stellar masses which are typically 0.13 dex 
less than the best-flt stellar masses. This may be due 
to younger and dustier stellar populations of the z ~ 2.3 
massive galaxies. As the slope of the red sequence is 0.08 
mag dex~^, this may result in a systematic error of ^0.01 
mag on the observed color evolution. Furthermore, this 
effect may also alter the cut-off value of lO^^M©, and con- 
sequently the selection of massive red-sequence galaxies. 
Increasing the mass cut-off value by 0.13 dex would not 
change the peak location of the red sequence. However, 
the number and mass fractions of massive galaxies on the 
red-sequence would increase by ~10%. 

- Completeness The z '^ 0.03 sample is volume lim- 
ited, and thus completeness effects are not expected to 
play a role. For the z ~ 0.73 sample we use just the spec- 
troscopic subsample to estimate the red-sequence color, 
as photometric redshifts results in less accurate rest- 
frame colors. However, including all galaxies, would give 
a rest-frame U — B color which is 0.01 mag less. Thus, 
completeness effects may result in a systematic error of 
~ 0.01 for the z ~ 0.73 sample. 

The z ~ 2.34 sample i s much smaller and not volume 
limited. In iKriek et al.l ()2008l ) we flnd that the distri- 
butions of rest-frame U — V color, observed R — K and 
J — K color and redshift for the spectroscopic sample 

^ http://www.sdss.org/ 
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Fig. 4. — Top panels: Rest-frame U — B color versus stellar mass for the three galaxy samples. The two lower redshift samples are 
adopted from Ivan der Wei et al.| (12007). Galaxies without spectroscopic redshifts in the 0.6 < z < 1.0 sample are indicated in gray. The 
U — B colors are corrected for redshift differences within the sample using equation l[2]l. Bottom panels: The color distribution extracted 
along the z ~ 0.03 slope (0.08 mag dex"'^). The peak of the red sequence is represented by the solid gray and black lines in the top and 
bottom panels respectively. All galaxies above the gray dotted lines in the top panels are defined as red-sequence galaxies. The dotted and 
dashed lines in the bottom panels indicate random and total Icr uncertainties, respectively 



at 2 < z < 3 are similar as for a photometric mass- 
and volume-limited sample at the same redshift inter- 
val. Unfortunately, we can only compare photometric 
properties in order to investigate whether the subsample 
is representative, and thus systematics in photometric 
studies m ay jeopardiz e the r eal completeness. For ex- 
ample, in iKriek et al.l ([2008') we identified systematics 
between photometric redshift and SED type, such that 
dusty, young galaxies were generally placed at too high 
redshifts. As these dusty galaxies scattered to lower red- 
shift, they are not included in the sample used in this 



work. Dusty galaxies with 2 < 



< 3 may not be 



properly represented in the sample, as they were ini- 
tially placed at too high redshift. Although this possible 
incompleteness may not alter our findings of the color 
of the red sequence, it should be kept in mind that the 
sample may not be complete and representative of the 
total population of 2 < z < 3 galaxies. We estimate that 
completeness effects may result in a systematic error of 
^ 0.03 mag on the rest-frame U — B color of the z ^ 2.3 
red sequence. 

The various effects discussed above result in total sys- 
tematic uncertainties of 0.03, 0.04 and 0.05 mag for the 
z ~ 0.0, z ~ 0.7 and z ~ 2.3 red-sequence colors re- 
spectively. We assume that the random and systematic 



errors are independent and can be added in quadrature. 
The color evolution of massive galaxies is shown in Fig- 
ure [5] The small filled dots show individual red-sequence 
galaxies, and the large filled symbols with errorbars show 
the peak locations for galaxies on the red sequence. The 
dotted line is a simple linear fit to the large symbols, of 
the form: 

(C/-B)^ =0.19 + 0.33 t (2) 

with t the fractional age of the universe. The fit was 
used to apply differential color corrections to account for 
redshift differences within each sample (see above), and 
provides a remarkably good fit. 

Table [T] lists the rest-frame U — B color and the width 
of the red sequence for the three samples. The color of 
the red sequence evolves by ~0.25 mag between between 
z ~ 2.3 and z r^ 0.0. In §[5] we attempt to explain this 
evolution using simple models. 

4.3. Evolution of the Mass and Number Density 

In addition to A(C/ — S), the evolution of the number 
and mass density of red-sequence galaxies places con- 
straints on the build up of the red sequence. The densi- 
ties for the z < 1 samples follow directly from the used 
samples, as both are volume limited. To obtain the rel- 
ative fractions at z '^ 0.73 we use the full 0.6 < z < 1.0 
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Fig. 5. — U — B color versus time for all galaxies in the three 
massive galaxy samples (> 10^^ Mq). The rest-frame U — B colors 
for the individual galaxies in this plot are corrected for the slope 
in the U — B vs. stellar mass relation (0.08 mag dex~^), and 
given for a stellar mass of 2 X 10^^ Mq. The red-sequence galaxies 
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The gray dotted line represents the best linear fit through the red- 
sequence locations. This relation is used to correct U — B colors 
for redshift differences within the samples. 
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sample, including galaxies with photonietric redshifts, in 
combination with the red-sequence location as derived 
from the spectroscopic sample. 

The galaxies at 2 < z < 3 do not form a com- 
plete sample. We use the MUSYC deep s urvey (1030, 
1256 and HDF-South, iQuadri et aP l2007t l to estimate 
the total number of massive galaxies (> IO^^Mq) at 
2 < z < 3 (see § [5]). Note that most galaxies of our 
spectroscopic sample are extracted from this survey. We 
correct the obtained densit y for systema tics in photomet- 
ric redshifts as derived in iKriek et al.l (J2008.1 . We find 
a total number density of massive galaxies of p {M > 



10"A/o) = 2.0 X 10-4 Mpc -^. This value is consis - 
tent with the values found by Ivan Dokkum et all (|2006l ) 
of 2.2J:°;^ X 10-'* Mpc-3 and by iDrorv et al.l (|2005^ of 
~ 1.8 X IQ-^ Mpc-"^ for the same mass-cut and IMF. 
Subsequently we use the mass and number fractions, as 
estimated from the spectroscopic sample to derive the 
mass and number densities of red-sequence galaxies. We 
find a number density of massive red-sequence galaxies 
at zr^ 2.3 of l.ltol X 10"" Mpc-^. 

All number and mass densities are given in Table [H 
The uncertainties on the fractions are determined by 
bootstrapping (see ii l4.2p . For the number and mass den- 
sities, the uncertainties include cosmic variance, random 
errors, uncertainties on the fractions, and uncertainties 
introduced by different stellar mass estimators. The frac- 
tion of the total massive galaxy population that is on the 
red sequence has evolved by only ~ 20% from z ~ 2.3 
to the present. Similarly, the fraction of the total stel- 
lar mass in galaxies with M > IQ^^Mq that is on the 
red sequence has evolved by ~ 15%. However, the total 
number and total mass of red sequence galaxies has in- 
creased by factors of ~ 8 and ~ 6 respectively over this 
time interval. 

4.4. Comparison with Other Studies 

In this section we discuss our results in the context of 
other studies that investigated the evolution of the red se- 
quence. However, when comparing our findings to these 
studies, it is important to keep in mind the difference in 
sam ple selection and red -sequ ence definition. For e xam- 
ple. (Shaplev eTaH (l200l and iReddv eFaH (|2006bD find 
no red-sequence galaxies among their z ~ 2 Lyman Break 
Galaxies (LBGs lSteidel et al]ll996al lbl). as the galaxies 
are selected by their bright optical flux, and thus they 
only target star-forming galaxies. Only 20% of a mass- 
limited sample would be identified by LEG selection, and 
thus this selection technique is not suitable for obtain- 
ing u nbiased mass-limited samples (|van Dokkum et all 

Mm- 

Using a mass-limited sample with a wide variety of 
data over a total field of 1.53 deg^, IConselice et al.l 
(|2007f ) find that the red-sequence fraction of galaxies with 
IO^Mq < M, < 10"-^Mq increases from 0.65 to 0.88 
between z ^ 1.4 and z ~ 0.4. Direct comparison with our 
work is complicated by the di fferent mass range, a dif- 
ferent adopted IMF ()Chabrieill2003f ). and different color 
criteria to identify red-sequence galaxies. Nevertheless, 
the slight decrease of this fraction may be consistent with 
the findings presented in this work. 

Our study is also broadly consistent with the results of 
the K20 survey. iFontana et all (|2004f ) find that ~30- 
40% of the present day stellar mass in objects with 
5 X 1O*°M0 < Af* < 5 X 1O"M0 appears to be in place 
at z ~ 2. We find that for M* > 1 x lO^A/© 20% 
of the mass was already in place at a slightly higher 
redshift of z '^ 2.3. Given the large uncertainties of 
both studies, the different mass cuts, and the different 
targeted redshift range, these results are not in conflict 
with each other. Their relative fraction of red to blue 
sequence galaxies is difficult to compare to ours, as K20 
has spectroscopic redshifts and classiflcations for 43% of 
the galaxies at 1.3 < z < 2.0. 21% are early types and 
22% are late types. So for more than half of the sample 
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the nature of the galaxy is unknown. As late types are 
easier to spectroscopically confirm (especially with opti- 
cal spectroscopy) than early types, it is not unlikely that 
the early types span the majority at this redshift range. 
Thus, their result is not inconsistent with the fractions 
w e find for our sp e ctrosc opic sample at z~2.3. 

ICirasuolo et al.l (|2007f ) find that less than half of the 
galaxies exceed their red-sequence cut. However, these 
authors use a i^-selected, instead of a mass-limited sam- 
ple. iiT-selection gives higher fractions of blue galaxies, 
as such galaxies are brighter in K than red galaxies at 
the same stellar mass. Also, since their color distribu- 
tion shows no obvious peak, their definition of the red- 
sequence galaxies canno t be di rectly compared to ours. 

Finallv. lCassata et al.l (|2008D identify color bimodality 
out to z=2 among the galaxies in the GMASS survey. 
Their sample has a spectroscopic completeness of 50%, 
with 190 spectroscopic redshifts beyond z=1.4. The au- 
thors show that early-type galaxies make up 50% of the 
population at the high mass end at z = 2. When impos- 
ing the same color and mass cut as used in this paper, we 
derive that red-sequence galaxies would dominate. Fur- 
thermore, in agreement with our study, they find that 
the rest-frame U — B color evolves by only ~ 0.2 mag 
between z r^ 2.5 and z '^ 0.5. 

In summary, it is difficult to directly compare our re- 
sults with other studies, as few in the targeted redshift 
range use mass limited samples. Even more importantly, 
as already discussed in § 3.1, the lack of spectroscopic 
information prevents accurate rest-frame color determi- 
nations of red galaxies for current z > 2 studies. Nev- 
ertheless, our study seems broadly consistent with other 
work. 

5. MODELING THE EVOLUTION OF THE RED SEQUENCE 

In the previous section we derived the evolution of rest- 
frame U — B color and the number density for the massive 
end (> IO^^Mq) of the red sequence between z ~ 2.3 and 
the present. In this section we attempt to explain this 
evolution using simple models. 

5.1. Aging of Stellar Populations 

The red-sequence galaxies at z ~ 2.3 have no or 
very little ongoing star formation and thus are expected 
to evolve passively over time. Therefore, simple aging 
of stellar populations seems the most straightforward 
explanation for the observed color evolution. In Fig- 
ure [6K we compare the measured U — B color evolu- 
tion to single burs t , sola r me tallicity models by both 
iBruzual fc CharlotI (|2003h and iMarastoii (|2005[ ). In § |3] 
we determine that the 2; ~ 2.3 galaxies are in a post- 
starburst phase, with typical ages of ~0.5 and ^1 Gyr 
for dust and dust-free models, respectively (jKriek et al.l 
l2006bl[2008l ). Therefore, we assume a formation redshift 
of the stars of Zform = 3.6. For comparison we also give a 
subsolar model (0.4Zq) and a model with infinite forma- 
tion redshift, both from the IBruzual fc CharlotI ()2003l ) 
library. 

Figure [6^ shows that passive stellar population mod- 
els with Zform = 3.6 and solar metallicity predict a too 
strong color evolution, and produce galaxies at z ^ 
that are too red. Only sub-solar models can explain the 
observed evolution at 2 x 10^^ Af©. However, this is in 
contradiction with studies of local ellipticals with similar 



stellar ma sses, which imply sol ar to supersolar metallic- 
ities (e.g.. lWorthev et al.l[l99l l. 

Perhaps more importantly, passive evolution models 
do not predict any evolution in the density of galaxies on 
the red sequence, in clear confiict with the observations 
(see solid curve in Fig. [7|). Therefore, the evolution of 
the red sequence is more complicated than just aging. In 
the following sections we discuss different processes that 
may have depressed the color evolution. 

5.2. Growth of the Red Sequence by Transformations 

Not all galaxies stop forming stars at high redshift, 
and the red sequence will be constantly supplemented 
by newly quenched galaxies. Galaxies that move to the 
red sequence at later times will be bluer than the galax- 
ies that already reside on the red sequence, and subse- 
quently will fiatten the total color evolu tion. This effect 
is kn own as the progenitor bias (van D okkum fc Frama 
|2Q01D . Qualitatively, such models can reduce the appar- 
ent color evolution while there is rapid density evolution 
and a constant influx of relatively young galaxies to the 
red sequence. 

We apply a simple model in order to test whether in- 
cluding progenitor bias can explain the observed evolu- 
tion. We assume a formation period of 100 Myr around 
•Zform in which all galaxies are formed. The individual 
galaxies are quenched at tstop; following the probability 
distribution of t= 



stop- 



P{tstop) OC exp(-t/Tstop) 



(3) 



We use IBruzual fc Chariot (|2003f ) models to construct 
the composite evolution from the color evolution of in- 
dividual galaxies. We simulate 10 000 galaxies and de- 
termine the red-sequence color using the same method 
as used for the data (see § 14. 2p . In contrast to 
Ivan Dokkum fc Franxl (|2001h . we assume a constant SFR 
between istart and tstop- 

In Figure [Bb we examine whether the supply of newly 
quenched galaxies can explain the observed U — B evolu- 
tion. The dotted curve represents the evolution for an ex- 
ponentially declining quenching model (rgtop = IGyr). 
The formation time has been set such that, consistent 
with the observations, the 2; ~ 2.3 red-sequence galaxies 
have stopped forming stars for ~1 Gyr (zform = 4.7). 
This model reduces A{U — B) compared to just aging 
(solid line), but matches neither the observed color nor 
the number density evolution (see Fig. [7]). 

One way to further decline the color evolution and bet- 
ter match the observed number density evolution, is in- 
creasing Tstop to infinity. However, a more realistic model 
would be to vary istart in addition to igtop- We define 
the probability distribution of the formation time of the 
galaxies as 



P{tstait) OC exp(-t/Ts< 



(4) 



The dashed curve in Figure[6lD represents the model with 
exponentially declining formation and quenching rates 
(■^start — 1 Gyr and Tgtop = 1 Gyr). Also for this model 
we assume a minimum formation redshift such that on 
average red-sequence galaxies at z ^ 2.3 have quenched 
their star formation 1 Gyr ago (zform — 12). This model 
provides a better fit to A(C/ — B) than the previous two 
models, and matches the evolution of the number density 
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Fig. 6. — Comparison of the observed rest-frame U — B color evolution {black filled circles) of the red sequence (at 2 X IO^^A/q) with 
simple models. In panel (a) we examine whether the observati ons are consistent w ith passive e volution. The so lid and dashed-dotted lines 
show the rest-frame U — B color evolution for SSP models bv lBruzual &: CharlotI ||200.3) and Marastonl l|2005l ). respectively. Both tracks 
assume solar metallicity and Zform = 3.6. For c omparison we also show a n SSP model for an infinite formation redshift (dotted curve), 
and subsolar metallicity (dashed curve), both bv lBruzual He CharlotI {2003'). The evolution is reasonably well-fitted by an SSP model with 
subsolar metallicity. However, this is irreconcilable with current studies of the metallicity of massive elliptical galaxies. In panel (b) we 
examine more complicated models. The dotted curve represents a model in which the red sequence grows by newly quenched galaxies. The 
galaxies have the same iatart , but a different burst length. In the model represented by the dashed line we vary istart as well. In the model 
represented by the dashed-dotted line the evolution is further flattened due to red mergers. The combination of all effects provides a good 
fit to the observed data. 
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Fig. 7. — The evolution of the number density of red-sequence 
galaxies more massive than 10^^ Mq (for a Salpeter IMF). The evo- 
lution tracks correspond to the models in Figure [BJj. Simple aging 
models (solid curve) do not match the evolution. The observed evo- 
lution is well fitted by models with exponentially declining tatart 
and (stop (dashed curve: Tatart = 1 Gyr and Tatop = IGyr). Also 
models with a large value for Tatop and a small value for Tfaim, or 
the other way around, would provide good fits. The effect of red 
mergers on the number density evolution are ignored in this Figure. 



of massive red-sequence galaxies (see Fig. [7])- Nonethe- 
less, the predicted color evolution for 2 x IO^^Mq red- 
sequence galaxies is still too slow, and other processes 
may be needed to match the color evolution. We note, 
howev er, that the stellar population models by Marasto^ 
(|2005[ ) would have provided a slightly better fit to the ob- 
served color evolution. 

5.3. Red Mergers 

Mergers on t he red sequence ma y also alter the color 
evolution (e.g.. iBower et al.]|1998f ). Assuming that no 
star-formation is triggered or other major processes take 
place, merging of two red sequence galaxies increases the 
stellar mass, but leave the color unchanged. Thus red 
galaxy mergers shift the red sequence to higher masses, 
and consequently reduce the color evolution when mea- 
sured at fixed mass. 

The dashed-dotted curve in Figure [HJd represents the 
color evolution when including red mergers, in addition 
to aging and a varying tgtart and istop- We assume a 
constant merger rate, normalized such that red-sequence 
galaxies experience one equal mass merger between z '^ 1 
and the present. Figure [Gla shows that the combined 
model provides a reasonable fit to the observed evolution. 
If we ignore galaxy transformations, we need 5-7 major 
mergers between z ~ 2.3 and the present to provide a 
good fit to the observed color evolution. 

Observational evidence for red mergers (e.g. , 
Ivan DokkunJ [20051 iTran et all [20051 : iBell et all [2006h 
validates this explanation. However, the corresponding 
decline in the rest-frame U — B color evolution in this 
study might be overestimated. First, our assumed 
merger rates between z ~ 1 and ^ may be too high. 
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as accurate observational constraints are still lacking. 
Furthermore, not all major mergers are equal mass 
mergers, and for example a 3:1 merger (A[C/ — i3]=0.020 
mag) has less impact on the evolution than a 1 : 1 merger 
{A[U - B]=0.024 mag). 

The effects of red mergers on the number density evo- 
lution are difficult to estimate using the simple models 
presented in this work. The merger rate may be depen- 
dent on mass, and accurate mass functions are needed 
to understan d the growth of the red sequence due to red 
mergers (see lBundv et al.ll2007[ ). 

5.4. Other Influences 

Although a combination of aging, quenching, and red 
mergers provides a good fit to observed evolution, there 
may be other possible explanation as well. One con- 
cern is the unknown dust content of red-sequence galax- 
ies, especially at z ~ 2.3. The well-defined shape of the 
z ~ 2.3 red-sequence makes large dust contents implausi- 
ble. However, as these galaxies recently stopped forming 
stars, they may still be in the process of losing their dust. 
The red-sequence galaxies have a median best-fit Ay of 
0.8 mag, but the constraints are poor with typical la er- 
rors of 0.5 mag. Furthermore, as explained in ii l3.2[ the 
degeneracy with metallicity may introduce an additional 
systematic error. Nonetheless, an Ay of 0.8 mag would 
lower the z ~ 2.3 red-sequence color in Figure by 0.16 
mag (for a lCalzetti et al.|[2000l reddening law). The re- 
sulting evolution requires a later formation redshift, and 
this results in a slightly bluer color at low redshift. How- 
ever, this effect is very small, and thus dust cannot be 
primarily responsible for the slow observed color evolu- 
tion. 

New starbursts in red-sequence galaxies, for example 
triggered by m e rgers, may reduce the color evolution. 
iBirnboim et al.l (|2007f ) suggest that quenched galaxies 
may undergo a second starburst due to gas accretion. 
This starburst moves the galaxies back to the blue cloud. 
Once the star formation is quenched for the second time, 
the galaxy moves again to the red sequence. Compared 
to just passive evolution, the galaxy will be bluer due to 
younger ages of the newly formed stars. In th is context 
it is interesting to note that lLabbe et al.l ()2007[ ) find that 
episodic star forming models provide the best explana- 
tion of the evolution of the blue sequence. Also, low-level 
residual star formation - or "frosting" of younger stars to 
an older "base" population - may depress the color evo- 
lution and could explain the apparently too blue colors of 
the l ow-redshift red-sequence galaxies (see iTrager et al.l 
I2OOOD . However, only ~15% of the low-redshift field el- 
liptic als show evide nce for fairly recent star formation 
(e.g.. lYi et al.l[2005h . and thus this effect can not fully 
explain the slow color evolution. 

An evolving IMF or metallicity may also alter the rest- 
frame U — B evolution of the red sequence. For exam- 
ple, in case metallicity is lower for galaxies that form or 
quench at later times, A(C/ — i?) will be slower. Further- 
more, several authors have suggested that the IMF of 
massive ell iptical galaxies may be top-heavy or "bottom 
light" (see Ivan DokkumI l2008al and references therein) . 
Such IMF s lead to slow er color evolution than Salpeter- 
hke IMFs (Tinslev 1980), although the effect is relatively 
small (van Dokkum.2008a.l . 

Finally, we note that the used stellar population mod- 



els (jBruzual fc CharlotI [200I iMarastonI I2005D might be 
incomplete, and the evolution in the models may be too 
strong. 

6. SUMMARY 

Our recent discovery of galaxies with quiescent stellar 
populations beyond z = 2 suggests that a red sequence 
is already in place at these redshifts. We examined this 
suggestion using our NIR spectroscopic survey of mas- 
sive galaxies at 2 < z < 3. The combination of spectro- 
scopic redshifts and detailed continuum shapes as pro- 
vided by the NIR spectra, allows the first accurate rest- 
frame color and stellar mass determinations for a massive 
galaxy sample beyond z = 2. 

The distribution of galaxies in the rest-frame U ~ B 
color versus mass diagram demonstrates the existence of 
a red sequence at z ~ 2.3, with a significance of > 3ct. 
The red sequence hosts ~ 60% of the stellar mass at the 
high mass end (> lO^M©) at 2 ~ 2.3. We study the stel- 
lar populations of the red-sequence galaxies using emis- 
sion line diagnostics, and stellar population modeling. 
The stellar continua, as provided by the NIR spectra, of 
nearly all red-sequence galaxies are best-fit by specific 
SFRs less than 0.05 Gyr~^. Furthermore, in contrast 
to the blue galaxies, they have no detected rest-frame 
optical emission lines (e.g.. Ha), or the line emission is 
dominated by AGN activity. Thus, both independent di- 
agnostics imply that the red sequence is dominated by 
galaxies with quiescent stellar populations. 

By combining rest-frame U — B with B — V , we find 
that the z ~ 2.3 red-sequence galaxies are in a post- 
starburst phase, with typical ages of ~ 0.5 — 1 Gyr. This 
finding is supported by the strong Balmer break in the 
stacked spectrum of all red-sequence galaxies. Overall, 
this implies that the red sequence is primarily driven by 
post-starburst galaxies at this epoch, and probably has 
just started to build up at z ~ 2.3. 

We study the rest-frame U — B color evolution of mas- 
sive galaxies by comparing our sample with spectroscopic 
galaxy samples at z ~ 0.03 and z ~ 0.73. Remarkably, 
rest-frame U — B evolves slowly, by only ~ 0.25 mag be- 
tween z ~ 2.3 and the present. The fraction of massive 
galaxies (> 10^^ Mq) on the red sequence increases by 
only ^ 20% between z ~ 2.3 and z ~ 0.0. Similarly, the 
fraction of the total stellar mass of massive galaxies on 
the red-sequence increases by only ^ 15%. However, the 
number and mass density of the massive (> IO^^Mq) 
red-sequence galaxies grow by factors of ^ 8 and ~ 6, 
respectively, over the same redshift interval. 

Overall, we show that the slow color evolution of the 
red sequence does not allow a straightforward explana- 
tion. Simple aging models predict a too strong color evo- 
lution, and consequently the red-sequence galaxies are 
too red at z '^ 0. Also, such models cannot reproduce 
the strong density evolution of galaxies on the red se- 
quence that we measure here. Presumably, the evolution 
is a combination of aging, galaxy transformations and 
red mergers. Furthermore, frosting of young stars, recent 
starbursts, dust, and an evolving IMF may also play a 
minor role in the evolution of the red sequence, but this 
remains to be explored. 

More accurate constraints from the color evolution re- 
quire independent measurements of the metallicity, dust 
content, ages, current SFRs, and IMF of red-sequence 
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galaxies over all epochs. As for the moment, the evo- 
lution of the number density and mass function provide 
the most powerful method to study the growth of the 
red sequence. The slope and spread {au-s) can also be 
used to set further constraints on its build-up. However, 
due to the significant errors on the rest-frame colors and 
the small size of the z ^ 2.3 spectroscopic sample, we 
use neither in this work. We do note that none of the 
models discussed here violate the constraints imposed by 
the observed scatter. 

Finally, we would like to note that our findings are 
supported by recent morphological studies. Using follow 
up high-resolution N1C2 imaging on HST and adaptive 
optics i maging with N1RC2 on Keck. Ivan Dokkum et al.l 
(|2008bf) find very compact sizes and extremely high in- 
ferred stellar mass densities for the red-sequence galax- 
ies in this paper (see also iTruiillo et al.l [20061 l2007t 



Toft etal.1 120071 : IZirm et all 120071 : [Longhetti et all 120071 : 
Cimattil 120081 ). This finding implies that these galax- 
ies do not passively evolve into the red-sequence galaxies 
in the local universe. Thus, both the morphological and 
color evolution require active evolution to take place that 
transform the z ~ 2.3 red-sequence galaxies to those at 
the current epoch. 
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